Ranking knots of random, globular polymer rings 
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An analysis of extensive simulations of interacting self-avoiding polygons on cubic lattice shows 
that the frequencies of different knots realized in a random, collapsed polymer ring decrease as a 
negative power of the ranking order, and suggests that the total number of different knots realized 
grows exponentially with the chain length. Relative frequencies of specific knots converge to definite 
values because the free energy per monomer, and its leading finite size corrections, do not depend on 
the ring topology, while a subleading correction only depends on the crossing number of the knots. 
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Issues related to the probability of realization of con- 
figurations with specific knots in closed random chains 
play a major role in topological polymer statistics [D, 0] 
and in its applications to macromolecular and biological 
physics . Interest in the spectrum of different knots re- 
alized in random polymer models is stimulated, e.g., by 
the need of comparison with the circular DNA extracted 
from some viral capsids [1, [f|, in the hope to identify 
specific biological mechanisms of knot formation. The 
low number of globular proteins for which a knot has 
been detected in the native state 0, Q marks a striking 
difference with respect to the general collapsed phase of 
homopolymers, in which a definitely higher knotting fre- 
quency is expected. Understanding the reasons of such a 
difference is certainly a key issue for the formulation of 
adequate statistical models of proteins 0. The interest 
in collapsed polymers is also stimulated by the recent re- 
alization that their knots, unlike prime knots in the good 
solvent case, are on average completely delocalized along 
the backbone @|. Understanding if and up to what ex- 
tent topological invariants can affect the globular state 
in such conditions is an intriguing fundamental issue. Al- 
ready in the swollen regime, the problem of precisely de- 
termining the possible dependence on topology of the free 
energy per monomer and of the exponent specifying its 
correction cx In N/N in the limit where the number N 
of monomers approaches infinity, remains open 1^, 11 1. 
This in spite of the fact that the localized character of 
prime knots should represent a simplifying feature. In the 
globular state similar issues have never been addressed 
and their discussion should include the free energy cor- 
rection associated to the existence of the globule-solvent 
interface 12, 13j |. 

In the present Letter we investigate the different 
topologies realized by the equilibrium configurations of 
a collapsed ring polymer. In spite of the considerable 
complexity of these configurations, we show that a rela- 
tively simple statistical law governs the frequencies of the 
various knots, with far reaching consequences. On one 
side, it allows to argue the rate at which the amplitude 
of the spectrum of different knot topologies grows with 



increasing ring length. At the same time, the emerging 
scenario confirms that knots are delocalized and clarifies 
how topology controls the statistics of the globular state. 

A cornerstone in topological polymer statistics has 
been the realization that, for self-avoiding polygons 
(SAP's), unknotted configurations are entropically dis- 
favored, so that their probability a ppr oaches zero expo- 
nentially with growing chain length Tl|,0, EH- However, 
after this important step, progress in the statistical anal- 
ysis of knot complexity was hindered by the circumstance 
that unknotted configurations in swollen, good solvent 
regimes remain overwhelmingly dominant even for rela- 
tively very long chains. On the other hand, for polymer 
rings in bad solvent, the collapsed state leads to expect 
a much higher probability of knotting compared to the 
swollen case with the same chain length. This circum- 
stance suggests collapsed polymers as ideal systems for 
the study of how topological complexity develops and 
statistically distributes itself with growing chain length. 

As a model of the large scale behavior of a long flexi- 
ble polymer in a solvent we adopt the self-avoiding walk 
on cubic lattice. Attractive energies (e = —1) between 
non-consecutive nearest neighbor visited sites allow us 
to work in the collapsed state at T < T@, where Tq is 
the theta-collapse temperature [lij]. In such regime it is 
rather difficult to sample a sufficient number of uncor- 
related polymer configurations. We use the pruned en- 
riched Rosenbluth method [13] (PERM) which revealed 
very effective and was successfully used also for the search 
of native states of protein models [HI]. Our computa- 
tional problem is made heavier by the fact that the closed 
chain configurations, i.e. the SAP's generated by PERM 
are only a small fraction of the total. A more serious dif- 
ficulty is the topological analysis of the knot type present 
in a given closed chain configuration. The configurations 
of long, collapsed SAP's are very intricate geometrically 
and their projections on planes present a huge number 
of crossings. This makes impossible the calculation of 
the topological invariants necessary for the knot identi- 
fication. In order to circumvent this difficulty, we sim- 
plify each sampled configuration before performing the 



2 



0.1 r 



0.01, 




2000 



FIG. 1: Probability of finding an unknot. The straight dashed 
line represents the exponential fit with iVo = 420. 



analysis of invariants. To this purpose we apply to the 
configuration a smoothing algorithm, which progressively 
reduces the length of the chain, while keepingits topol- 
ogy unaltered (for a similar procedure, see [5|). This 
algorithm is in fact a BFACF grand-canonical simula- 
tion [1 91 ] in which the step fugacity is fixed low enough 
to cause a rapid reduction of step number in the SAP to 
be analyzed. Projections of such shrunk SAP are then 
analyzed by the HOMFLY polynomial in "Knotscape" 
program [20(. Our code allows us to resolve prime knots 
up to 11 essential crossings, and composite knots up to 5 
components. 

Along the above lines we perform a systematic analysis 
of the knot spectrum for TV-step SAP's at T = 2.5 < T B 
(T e = 3.595 HU), for chain lengths up to N = 1800. 
A first result concerns the behavior of the probability 
Pq(N) of configurations with knot k = 0, where in- 
dicates the unknot, as a function of N. Our procedure 
allows the classification of almost all configurations for N 
up to 800, while from N = 1000 to N = 1800 we have a 
progressive degradation of performances, reaching a 59% 
of unresolved configuration for N = 1800. There is thus 
an uncertainty on the normalization needed to calculate 
probabilities like Pq,(N). The most plausible scenario is 
that unresolved configurations have complex knots, espe- 
cially prime knots with more than 11 crossings or com- 
posite ones with more than 5 prime components. They 
enter thus in the statistics as "unresolved". We see an 
exponential decay Pg ~ e -N/N ^ f or i ncreasm g jy_ The 
decay constant is No — 420 (see Fig. Q]). This value is 
two orders of magnitude smaller than that expected for 
T = oo (N w 2.4 x 10 5 QUI]), when SAP statistics is 
controlled by excluded volume alone. This small Nq in- 
dicates that configurations with knots have appreciable 
probability already for relatively short SAP's. In a re- 
cent work on Hamiltonian loops on the cubic lattice 22 1 
a value of No < 196 was estimated, about half the one we 
determine at T — 2.5. If we think of Hamiltonian loops 
as a (T = 0)-like situation, a lower value of Nq should 
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FIG. 2: Probability of rank-ordered knots for N=600. Curves 
are for sample sizes increasing as a power of 2, from 10 4 to 
64 x 10 4 . Some simple knot topologies (unknot, 3i, 4i, 52, and 
5i) and a more complicated one (6i#10i28) are also associated 
with their ranking. 



indeed be expected if the trend of No decreasing with T 
is general. 

A further step was the analysis of the knot type k of 
each sampled configuration and of its probability Pk(N) 
in the statistics. We rank in decreasing order these prob- 
abilities and, by simply indicating as P q the probability 
of the knot k with rank q (q = 1,2,3,...), we obtain 
the log-log plots reported in Fig. [2] These plots corre- 
spond to increasing numbers of sampled configurations 
for N = 600. All curves display the same slope and over- 
lap in the first part. They only differ for the cutoff: the 
richer the sample, the larger the maximum rank of the re- 
alized knots. At the same time, upon varying sample size, 
we observe stability of the rank ordering of the knots cor- 
responding to the initial linear parts of the plots. Thus, 
the estimates of Pfc(600) should not be affected by sys- 
tematic errors. All this means that the observed cutoff in 
rank is only due to limited sampling and has nothing to 
do with the presumably much higher cutoff on the spec- 
trum of different realizable knots due to the fact that the 
SAP length is finite. The power law behavior shown by 
all the plots is therefore a robust feature of the data in 
this regime. 

Figure [3] shows similar plots, this time for varying N 
and with quite rich samples. For clarity the effects of the 
statistical cutoffs are not shown, and probabilities are 
divided by the unknot probability Pq>(N). The slope of 
the plots is an increasing function of N. The estimates of 
the slope extrapolate to a value —0.61(3) (see Fig. 3) for 
N — ► oo. Thus, for large N the rank ordering statistics 
of the knots obeys a law of the Zipf type: 



P 9 (oc)/P (oo) 



(1) 



with r 0.6. The Zipf law was first observed in the con- 
text of linguistics [23j, where it rules the rank ordering 
in frequency of different words in a text. Since then it 
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FIG. 3: (a) Power-law ranges of P q {N) / P®(N) , for N = 200, 
400, 500, 600, 700, 800, 1000, 1200, 1400, 1600, and 1800. For 
N — 1800 also the cutoff, due to limited sampling, is shown, 
(b) Extrapolation of the exponent of the Zipf law for N — > oo. 
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FIG. 4: (Color online) Probability of a knot type k over the 
probability of an unknot, vs 1/iV, for some knots. Each ratio 
converges towards an asymptotic value Aj, /A$ . Straight lines 
are fits ~ exp[— (S k — 5q)/N]. 



emerged in different fields, ranging from economics to dis- 
ordered systems |2J] . For sure the validity of such simple 
law for knots in random polymers reveals a remarkable 
and unsuspected degree of organization of such form of 
topological complexity. 

The asymptotic validity of Eq. ([I]) with exponent r < 1 
allows us to infer a fundamental property of the knot 
spectrum, which at first sight would seem unaccessible to 
any numerical investigation. As mentioned above, there 
must be a cutoff q max (N) for the Zipf law (JT|). Even 
if the statistical cutoff due to finite sampling occurs at 
much lower q's, the dependence of q max {N) on N is a key 
information on the spectrum, providing a lower bound 
for the maximum number of different knot topologies a 
SAP of length N can host. Assuming validity of the Zipf 
behavior in Eq. ([1]) up to q — q m ax(N), in view of the 
normalization condition ^2 q P q (N) = 1 valid for any N, 
one easily concludes that 
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Thus, q max (N) ~ P^N)- 1 ^ 1 '^ ~ e N /° AN °. This means 
that the number of different possible topologies a SAP 
can host grows (at least) exponentially with N. 

The validity of the law in Eq. ([TJ) also for the asymp- 
totic ranking of knots is induced by some balance in the 
knot frequencies, which should not diverge relative to 
each other for N — » oo. For a collapsed globule in equi- 
librium, theoretical arguments and numerical results sug- 
gest the following form for the large N behavior of the 
canonical partition function [p3 |: 
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(2) 



spectively, and a is an exponent [161] . The stretched ex- 
ponential factor containing k\ clearly implies an interfa- 
cial contribution to the free energy, since the area of the 
globule-solvent interface is expected to grow ~ iV 2 / 3 . It 
is natural to expect an asymptotic behavior of the form 
in Eq. ([2]) also for ensembles with fixed knot topology like 
those considered here. The value of a could depend on 
the topology of the globule in such ensembles. Indeed, in 
the T — > oo case, there are indications that the analog of 
a for prime knots differs sensibly from that for composite 
knots 10] (as confirmed in 11 1). Dependences on topol- 



where F is the free energy, n and ki are reduced free 
energies per monomer and per interface monomer, re- 



ogy could not be excluded, a priori, also for parameters 
like k and n\. Indeed, the equivalent of k in the T — > oo 
limit for the unknotted ring is rigorously known to be 
different from the n in the ensemble with unrestricted 
topology On the other hand, the n of the unknotted 
ring has been conjectured to be the same as that of any 
other knotted ring for non- interacting SAP's fioj ] . 

An analysis of ratios / Pg should reveal the possible 
dependence of k, k\, a, and A on the knot type. These 
ratios are shown in Fig. [4] as a function of 1/N for the 
simplest prime and composite knots (25l |. They do not 
diverge. This implies that for the analyzed different knots 
in the collapsed polymer ring the parameters n, k%, and 
a must be the same. In particular, unlike in the T —> oo 
case, the entropic a exponent should be the same for all 
knots. On the other hand, Fig. [4] reveals that one should 
include a knot-dependent subleading factor exp(— Sk/N) 
in the form of the partition function @. The differences 
5k — St$ are proportional to the slopes of data sets in log- 
scale in Fig. |H These have values that essentially are 
determined by the crossing number n c of the knots (see 
Fig. [5|). For example, 5s 1 #3 1 ~ 5q 1 , and so on. A simple 
power-law increase 8k — &% ~ nj' 45 fits rather well the 
data in Fig. [5] 

We stress that for swollen polymers there are clear nu- 
merical evidences [13, U] of the conjecture that ctk = 
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FIG. 5: (Color online) Fitted 8k — 5$ as a function of the 
minimal crossing number n c for knots 3i (n c = 3), 4i (n c = 
4), 5i and 52 (n c = 5), 3i#3i, 6i, 62, and 63 (n c = 6), and for 
all knots with n c = 7. (Large circles denote averages for each 
n c ). <5fc's are grouped in bands corresponding to n c 's, i.e. 
to a good approximation they depend only on the number 
of essential crossings of a knot type. A fit is also shown, 
suggesting that 5k — 5$ grows as a power of n c . 



ot$ + 11 k, where itu is the number of prime components 
of the knot. This can be explained by taking into ac- 
count that each such component is localized along the 
chain , [ll[ , bringing an entropic factor ~ N to the 
partition function. On the other hand, the convergence 
of the relative frequencies of knots to definite values rules 
out this picture for collapsed polymers (evfc = a®). 

In summary, we have shown that a great deal of infor- 
mation can be gained from a numerical investigation of 
knots in collapsed polymers. The study of ratios of knot 
frequencies and the established Zipf type of law for the 
ranking in frequency of the knots, allow to draw very solid 
conclusions concerning the universality with respect to 
topology of several statistical parameters characterizing 
the globules. We find that the only non-universal parts of 
the knot frequencies relative to a reference frequency (the 
one of the unknotted rings) are an asymptotic (N — > 00) 
knot-dependent amplitude ratio and the rate of conver- 
gence to this ratio, which seems to be determined only by 
the crossing number of the knots. These results are con- 
sistent with the expectation that knots are delocalized in 
the collapsed regime [9] . The Zipf law also enables us to 
predict how the spectrum of different topologies grows in 
amplitude with growing ring length. 
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